CDS
Accession Number | TCMCG078C23207 |
gbkey | CDS |
Protein Id | KAG0491842.1 |
Location | complement(join(22362830..22362958,22363043..22363361,22363482..22363687,22363806..22363907,22364010..22364120,22364204..22364313,22364411..22364520,22364608..22364777,22365815..22365979,22366064..22366182,22366303..22366390,22366499..22366604,22366736..22366824,22367208..22367351,22367468..22367560,22367664..22367730,22367842..22367954,22368072..22368167,22368264..22368434)) |
Organism | Vanilla planifolia |
locus_tag | HPP92_005240 |
Protein
Length | 835aa |
Molecule type | protein |
Topology | linear |
Data_file_division | PLN |
dblink | BioProject:PRJNA633886, BioSample:SAMN14973820 |
db_source | JADCNL010000002.1 |
Definition | hypothetical protein HPP92_005240 [Vanilla planifolia] |
Locus_tag | HPP92_005240 |
EGGNOG-MAPPER Annotation
COG_category | G |
Description | beta-galactosidase |
KEGG_TC | - |
KEGG_Module | - |
KEGG_Reaction | - |
KEGG_rclass | - |
BRITE | - |
KEGG_ko | - |
EC | - |
KEGG_Pathway | - |
GOs |
GO:0003674
[VIEW IN EMBL-EBI] GO:0003824 [VIEW IN EMBL-EBI] GO:0004553 [VIEW IN EMBL-EBI] GO:0004565 [VIEW IN EMBL-EBI] GO:0005575 [VIEW IN EMBL-EBI] GO:0005618 [VIEW IN EMBL-EBI] GO:0005622 [VIEW IN EMBL-EBI] GO:0005623 [VIEW IN EMBL-EBI] GO:0005737 [VIEW IN EMBL-EBI] GO:0005773 [VIEW IN EMBL-EBI] GO:0015925 [VIEW IN EMBL-EBI] GO:0016787 [VIEW IN EMBL-EBI] GO:0016798 [VIEW IN EMBL-EBI] GO:0030312 [VIEW IN EMBL-EBI] GO:0043226 [VIEW IN EMBL-EBI] GO:0043227 [VIEW IN EMBL-EBI] GO:0043229 [VIEW IN EMBL-EBI] GO:0043231 [VIEW IN EMBL-EBI] GO:0044424 [VIEW IN EMBL-EBI] GO:0044444 [VIEW IN EMBL-EBI] GO:0044464 [VIEW IN EMBL-EBI] GO:0071944 [VIEW IN EMBL-EBI] |
Sequence
CDS: ATGCCACTAATGGCGCTCCAGTTGGCTCTAACGGCGTGGGCTCTGCTTTCGTCGCGACTGCTAACTCCAGTTGATGCCTCGGTTACTTACGACCGCAAGGCAGTGATCATCAACGGGCAGAGGAGGATTCTCATATCGGGCTCTATCCACTATCCGAGGAGCACTCCGGATATGTGGCCGGACCTTATCCAGAAGGCGAAAGATGGGGGCTTGGATGTCATTCAGACCTATGTATTCTGGAATGGACACGAGCCTTCTCCGGGACAGTACTACTTTGGGGGAAGGTATGATCTTGTTCAGTTTATCAAGTTGGTGAAGCAGGCCGGACTTTATGTTCATCTCCGCATTGGTCCCTATGTTTGTGCTGAATGGAATTTCGGGGGATTTCCTGTCTGGCTAAAATATGTTCCTGGTGTCAGTTTCAGAACAGACAACGGGCCTTTCAAGGCGGCCATGCAAAAATTTACAGAGAAGATTGTAAACATGATGAAATCAGAAGGTTTATTCGAATGGCAAGGTGGTCCTATCATCCTCTCTCAGATAGAGAATGAGCTCGGGCCAGTTGAGTATGATGATGGTGAGCCTGTAAAAGCTTATGGCGTCTGGGCTGCTAAAATGGCAGTTGGCCTGAACACCGGTGTTCCGTGGGTTATGTGCAAGCAAGATGATGCTCCAGATCCAATTATTAACACCTGCAATGGTTTTTACTGTGATTACTTCTCCCCAAATAGGCCTTACAAGCCTACTATGTGGACTGAAGCTTGGACCGCATGGTTCACAGGGTTTGGTGGTGCAGTTCCTCACCGGCCTGTTCAAGATTTGGCTTTTGCTGTTGCAAGGTTTATTCAGAAAGGTGGATCCTTTATCAACTATTATATGTACCATGGAGGGACCAACTTTGGTAGGACAGCTGGTGGCCCCTTCATTGCAACCAGCTATGACTATGATGCTCCAATTGATGAATATGGCTTACTAAGGGAACCAAAATGGGGGCATTTGAGAGACTTACATAGAGCAATTAAGTTGTGCGAACCTGCTCTTGTTTCTAGCGATCCTATAGTATCATCACTGGGCCAATCTCAACAGTCTCATGTCTTCAGAACAAGTTCAGGGGCATGTGCTGCTTTCCTGGCTAACTATGACTCTGGGTCTTATGCAACAGTAACTTTCAATGGAATGCACTACAATCTTCCTCCTTGGTCCATCAGCATCCTTCCTGATTGCAAAACCACAGTTTACAATACTGCAAAGGTAGGTGCTCAGTCTTCATTGATGAAGATGACTTGGTTGGGAAGCTTTTCGTGGCAATCATTCAATGAGGAGACTAACTCTCTGGATGATAGTTCGTTTACAAAACTTGGATTGTATGAGCAATTAAGTCTAACATGGGACAAATCAGACTACCTTTGGTATACAACATACGTCAACATAGGCCAAGATGAGCAATTCTTGAAGACAGGCAATTACCCTGTCCTTACAGTCTTATCTGCTGGCCATGCTTTACATGTTTTTGTCAATGGACAATATGCAGGAAATGCATATGGTGGTGTTGATGACCCAAGACTAACATATACTGGAAATTTAAAGATGTGGGCTGGCAGCAATAAGATATCTATACTAAGCGTTGCTGTTGGTTTGCCTAATGTGGGAAATCATTTTGAGACTTGGAATGCTGGAATTCTTGGTCCAGTAACTCTGAGTGGCCTTAATGAAGGAAAAAGAGACCTTTCGCATCAACAATGGACTTACCAGGTTGGTATGAAAGGTGAACACTTGAGCCTTCATTCACTTGATGGAAGTTCCTCGGTTGAATGGGGAGATGTGTTTCCACATCAACCTTTGACATGGTTCAAGACTTTCTTCAATGCTCCTGATGGCAATGAGCCATTGGCTCTAGATATGAGTAGCATGGGAAAAGGGCAGATTTGGATAAATGGTGAAAGTATTGGCCGCTACTGGCCCGCTTACAAAGCCTCTGGTTCATGTGGCCCTTGCGATTATCGTGGCACATATGATGAAAATAAATGCCGAAGTAACTGTGGTGACTCCTCTCAAAGATGGTATCATGTTCCACGGTCGTGGCTGAAGCAAACGGGGAATTTGTTGGTCGTGTTCGAGGAGTGGGGAGGTGATCCAACAAAGATCTCTATGGTAAAGAGGACCCTTGAGAGTGTTTGCGCTGAAATTGCCGAGTGGCAGCCGGCGGTGGATAATTGGCGTACTAAGAAATATGGAAGACCCAAAGCTCACCTTTCTTGTCCTGCTGGTCACAAGATAAGTACCATAAAGTTTGCAAGCTTTGGAACTCCGCAAGGAGGGTGTGGCAGCTACTCAGAGGGCGTCTGCCATGCTCACAGATCTTATGATGCTTTGGAGAGGGATAATGATATGTTGCAGAACTGTGTTGGGCAGCAAGCATGCTCTGTAGCAGTTGCTCCAGAAGTTTTTGGTGGAGACCCATGCCCTGGAAATATGAAAACTCTTGCTGTTGAGGCAATTTGTCAATAA |
Protein: MPLMALQLALTAWALLSSRLLTPVDASVTYDRKAVIINGQRRILISGSIHYPRSTPDMWPDLIQKAKDGGLDVIQTYVFWNGHEPSPGQYYFGGRYDLVQFIKLVKQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGVSFRTDNGPFKAAMQKFTEKIVNMMKSEGLFEWQGGPIILSQIENELGPVEYDDGEPVKAYGVWAAKMAVGLNTGVPWVMCKQDDAPDPIINTCNGFYCDYFSPNRPYKPTMWTEAWTAWFTGFGGAVPHRPVQDLAFAVARFIQKGGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEYGLLREPKWGHLRDLHRAIKLCEPALVSSDPIVSSLGQSQQSHVFRTSSGACAAFLANYDSGSYATVTFNGMHYNLPPWSISILPDCKTTVYNTAKVGAQSSLMKMTWLGSFSWQSFNEETNSLDDSSFTKLGLYEQLSLTWDKSDYLWYTTYVNIGQDEQFLKTGNYPVLTVLSAGHALHVFVNGQYAGNAYGGVDDPRLTYTGNLKMWAGSNKISILSVAVGLPNVGNHFETWNAGILGPVTLSGLNEGKRDLSHQQWTYQVGMKGEHLSLHSLDGSSSVEWGDVFPHQPLTWFKTFFNAPDGNEPLALDMSSMGKGQIWINGESIGRYWPAYKASGSCGPCDYRGTYDENKCRSNCGDSSQRWYHVPRSWLKQTGNLLVVFEEWGGDPTKISMVKRTLESVCAEIAEWQPAVDNWRTKKYGRPKAHLSCPAGHKISTIKFASFGTPQGGCGSYSEGVCHAHRSYDALERDNDMLQNCVGQQACSVAVAPEVFGGDPCPGNMKTLAVEAICQ |